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Abstract 

We present a model independent method to test the consistency between cosmological measurements of distance and age, assuming 
the distance duality relation. We use type la supernovae, baryon acoustic oscillations, and observational Hubble data, to reconstruct 
the luminosity distance Di{z), the angle averaged distance Dyiz) and the Hubble rate H{z), using Gaussian processes regression 
technique. We obtain estimate of the distance duality relation in the redshift range 0.1 < z < 0.73 and we find no evidence for 
inconsistency between the data sets used. 


1. Introduction 

An explanation for the observed accelerated expansion of the 
Universe still eludes cosmologists. The evidence for the ex¬ 
istence of dark energy (DE), which is believed to source this 
acceleration [1, 2, 3], has been continuously piling since the 
first indications from supernovae (SNela) observations [4, 5]. 
Thanks to the abundance of good quality data cosmology has 
become a precision science. But even in this era of data driven 
cosmology we know little about constituents of our Universe. 
There are several large cosmological surveys that are aiming 
to answer some of the key question in cosmology today, like: 
what is the source of the acceleration of the Universe and what 
are the properties of this mysterious source. Further, there are 
many ambitious projects that are planned for the future [6]. 
In this scenario it is important to check whether the measure¬ 
ments coming from various surveys are consistent with each 
other. This is crucial since multiple probes are often combined 
to put better constraints on cosmological parameters. Consid¬ 
ered together, one data set resolves the difficulties of the other, 
allowing certain degenerate parameters to be determined with 
far greater precision. A consistency check would also help us 
to examine different data sets for the presence of systematics. 

Knox et ah, examined consistency of the different cosmic mi¬ 
crowave background data sets to check whether the data are 
contaminated by some residual non-cosmological signals [7]. 
Avgoustidis et ah, studied the consistency among SNela, mea¬ 
surements of the Hubble parameter, and the baryon acoustic 
oscillation scale. Regarding the tension between the baryon 
acoustic oscillation scale and SNela data in light of possible 
deviations from transparency, they concluded that the source of 
the discrepancy may most likely be found among systematic ef¬ 
fects of the modelling of the low redshift data, or it may be a sta¬ 
tistical fluke [8]. Shafieloo et ah, compared two different probes 
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of the expansion history of the universe, namely, luminosity dis¬ 
tances from SNela and angular diameter distance from galaxy 
clusters. They proposed a model independent method to search 
for inconsistencies between SNela and galaxy cluster data sets 
[9]. More recently, Hazra and Shaheloo studied the consistency 
of the angular power spectrum data from WMAP and Planck 
looking for possible systematics [10]. Cao and Zhu used obser¬ 
vational data with four angular diameter distance measurements 
and synthetic SNela-i-GRB observations for luminosity distance 
data, to investigate the tension between these two cosmological 
distances considering three classes of dark energy equation of 
state reconstruction. They found that the angular diameter dis¬ 
tance measurements and the luminosity distance data are com¬ 
patible at Icr level [11]. Ruiz and Huterer tested the consis¬ 
tency of the standard wCDM model in the framework of Gen¬ 
eral Relativity by separating information between the geometry 
and growth of structure. Using data from SNela, baryon acous¬ 
tic oscillations, the peak locations in the cosmic microwave 
background angular power spectrum, redshift space distortions, 
weak gravitational lensing and the abundance of galaxy clus¬ 
ters, and found that both geometry and growth separately favour 
the ACDM cosmology [12]. 

In this work we look for consistency in distance and age mea¬ 
surement data sets using the distance duality (DD) relation. We 
use baryon acoustic oscillation (BAG) and SNela data for dis¬ 
tance measurements and observational Hubble data as age mea¬ 
surement. The plan of the paper is as follows: we begin with a 
brief overview of DD relation in section 2, in section 3 we dis¬ 
cuss data sets used and a quick over-view of Gaussian Processes 
which is the methodology used here, we discuss our results in 
section 4. 

2. Distance duality relation 

In 1933 Etherington proved the reciprocity relation in area 
distances, between a source and an observer in relative motion 
with each other [13]. This relation is valid for any curved space- 
time, and even basic symmetry assumptions like homogeneity 
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and isotropy are not required. The relation holds as long as 
gravity is described by a metric theory, photons travel on null 
geodesics and the geodesic deviation equation is valid [14, 15]. 
If photon number is conserved this further reduces to a rela¬ 
tion between the angular diameter distance and the luminosity 
distance [16]: 

Di = D^(l+z)l (1) 

This is termed as the DD relation and plays an important role in 
galaxy cluster observations and lensing studies [17]. Since DD 
is crucial to cosmological studies and plays a key role in how 
galaxy observations are analysed, it is important to check its 
validity. Since both the distances in the duality relation are ob¬ 
servable quantities it is possible to test this relation. Now, there 
are many ways in which cosmic distances are measured. One 
can look for sources that can be used as standard candles for 
deriving luminosity distance, and standard rulers can be used to 
derive angular diameter distance. SNela can be used as standard 
candles since they have a peak luminosity that is tightly corre¬ 
lated with the shape of their light curves and hence they can 
be calibrated. On the other hand, combined measurements of 
the Sunyaev-Zeldovich effect and X-Ray analysis (SZE/X-ray) 
provides a measure of the angular diameter distance to a cluster. 
The Baryon acoustic feature in the matter clustering is another 
independent distance indicator and can be used as a standard 
ruler. As observed in previous works ([18, 19]), the constraints 
on the DD obtained from galaxy cluster measurements, depend 
on the assumptions of cluster geometry (spherical or elliptical). 
Hence assuming DD to be true, one can use it as a probe of 
cluster geometry. DD has also been used to constrain the cos¬ 
mic opacity between different redshifts. 

One of the assumptions in DD, is the conservation of pho¬ 
ton number. Hence, the temperature redshift relation, relat¬ 
ing the observed and emitted temperature of the cosmic mi¬ 
crowave background photons, which derives from DD, also as¬ 
sumes photon conservation. The relation will be modified if 
this assumption was violated. There are many mechanisms that 
have been proposed in literature which give rise to such a vio¬ 
lation, for example decaying vacuum cosmology, photon axion 
coupling etc.. In this regard, there have been several attempts 
to measure the cosmic microwave background temperature at 
different redshifts, using, for example, quasar absorption line 
spectra and this can be used to test the validity of the temper¬ 
ature shift relation. But since the uncertainties are large, more 
data is required to put robust constraints. Testing the DD offers 
another way to confirm photon conservation. Assuming there 
are some unclustered sources of photon attenuation in the Uni¬ 
verse, one can use DD to put constraints on the difference in 
opacity between two redshifts. Refer to [ 8 , 20, 21] for details 
of such studies. 

3. Methodology and data sets used 

3.1. Methodology 

As mentioned earlier, we use SNela, BAO and observational 
Hubble data to constrain DD (details of the data sets given in 
3.2). Since SNela are expected to form from standard explosion 


of a white dwarf, they are assumed to have homogeneous light 
curve and uniform luminosity. Although there is an intrinsic 
scatter in the peak luminosities of SNela, an empirical correla¬ 
tion exists between the shape of the SNela light curve and the 
SNela luminosity. Hence these candles can be standardized, 
and are used as standard candles for estimating luminosity dis¬ 
tances. 

Observational Hubble data is obtained from the measurement 
of the relative ages of passively evolving galaxies. The Hubble 
rate depends on the differential age of the Universe as: 


H{z) = - 


1 dz 
1 + zdt’ 


( 2 ) 


and hence the determination of dzjdt gives an estimate of H(z). 
For this one has to look for the variation of ages, Af, with red- 
shift Az. Collection of galaxy samples of passively evolving 
galaxies with high quality spectroscopy, are used and differen¬ 
tial ages for the samples are computed. These are then used as 
estimates of dzidt which eventually gives an estimate of H{z). 

BAO refers to a length scale in the distribution of the pho¬ 
tons and baryons. This scale is imprinted in the matter distri¬ 
bution due to the stalling of sound waves in the plasma of the 
early Universe, and hence they can be treated as cosmological 
standard rulers. Enough BAO data has not been accumulated 
to separately measure the tangential and radial components of 
the signal, and hence the two distances. But it is possible to 
constrain an angle-averaged clustering measurement, obtained 
from the combination of two spatial dimensions orthogonal to 
the line of sight and one dimension along the direction of sight, 
as defined below [ 22 ]: 
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where, is the angular diameter distance, z is the redshift and 
H is the Hubble rate. 

If the DD relation holds, then we know that the luminosity 
distance and the angular diameter distance are related as Di - 
Da{\ + z)^. Let us assume 


Dl 

Da(1+z)2’ 


(4) 


where 77 = 1 if the DD relation holds. Using this we can rewrite 
(3) as 


DdczY'^ 


(5) 


Now, if the distance measurements and the age measurements 
are consistent with each other, and the DD holds then 77 in (5) 
should be equal to 1. A deviation from 1 may imply the break¬ 
down of one or more of the assumptions mentioned earlier or 
it may indicate the presence of some systematic in the data sets 
used. Many authors have studied the DD relation using vari¬ 
ous data sets ([23]). Constraints obtained from using cluster 
data for the angular diameter distance estimate depends on the 
galaxy cluster model assumed. Since we are using BAO data 
for the angular diameter distance estimate our constraints does 
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not contain such biases. Another source of error in the analysis 
of DD is that it is not always possible to obtain a luminosity 
distance estimate and an angular diameter distance estimate at 
the same redshift, and some kind of redshift-matching criteria 
is adopted. In this work we reconstruct all the observable quan¬ 
tities {H{z), Dy{z) and Di{z)) in the redshift range of interest 
using a non-parametric method called Gaussian Processes and 
then estimate 77 using these quantities. 

As mentioned earlier we take Dyiz) measurements from 
BAG data, Di{z) measurements from SNela data, and Hiz) 
measurements from the observational Hubble data obtained 
from differential ages of passively evolving galaxies, to check 
the above relation. The details of the data sets are given in the 
next section. 

3.2. Data 

• To estimate the luminosity distance Di{z) at various red- 
shifts we use the distance modulus measurements from 
SNela Union2.1 sample, as given in [24]. This sample 
contains 580 supernovae and spans 0.015 < z < 1.414. In 
the data set, the distance modulus is given in terms of the 
redshift, and this is used to estimate the luminosity dis¬ 
tance. The relation between the distance modulus yu and 
the luminosity distance Di is: 

A'Bfe) = '«B-A^ii = 51ogio|^^^j+25, (6) 

where Mg is the absolute magnitude of the source and niB 
is the apparent magnitude {B is for B-band). 

• We use eight measurements from BAG data compiled from 
different groups in the redshift range 0.106 < z < 0.73. We 
use data from SDSS (z = 0.2,0.35 and 0.15), 6 dFGS(z = 
0.106), WiggleZ(z = 0.44,0.6,0.73) and BGSS(z = 0.57) 
for estimating the distance [25, 26, 27, 28, 29]. 

• We also use the observational Hubble data as compiled in 
[30]. Note here that we only use those data points which 
are obtained from the analysis of differential ages of galax¬ 
ies. 


which means that the value of f{x) at any point x, is a Gaussian 
random variable with mean ju(x) and covariance k{x, x'): 

fi(x) = E(f(x)), ( 8 ) 

k(x,x') = E((f(x) - fi(x)){f(x') - fi(x'))). (9) 

There are many choices for the covariance functions: squared 
exponential, spline, polynomial etc. Here we chose the com¬ 
monly used squared exponential function for its simplicity. The 
squared exponential covariance function is expressed as: 

k(x, x') ^ crj exp ^ j . ( 10 ) 

This covariance functions is parameterized by the two param¬ 
eters, cTf and / (known as hyperparameters), which represent 
the length scales in the GP. crj controls the variation in f(x) 
relative to the mean and / corresponds to the correlation length 
along which the successive f(x) values are correlated. As de¬ 
sired, the covariance is maximum for variables whose inputs 
are very close which is expected for smooth functions. The ma¬ 
trix elements of the covariance matrix for the GP: K{X, X), are 
given by 

[K(X,X)],-, = k(v,-,v,). (11) 

Similar to the function f{x), the data y can also be represented 
using GP: 

y ~ GP(fi{x),k(x,x')), (12) 

Now, given a set of inputs X (also called training vectors), out¬ 
puts y (the data set, also called target), and the covariance ma¬ 
trix K(X, X), our aim is to make inference about the function 
f(x) at some other points X. The joint probability distribution 
for the data y and the reconstructed function f is given by 



p 

K(X,X)-hC 

K(X,X)1\ 

]f) \ 

p 

K(X,X) 

K(X,X)Jj 


where fi and pi are the assumed means (initial guesses) and C 
is the covariance matrix of the data, which is diagonal if the 
data points are uncorrelated. After some matrix algebra one 
can rewrite the joint probability distribution as (please see [31] 
or [32] for more details of the calculations) 


3.3. Gaussian Processes 

A Gaussian Process (GP) is a collection of random variables, 
any finite number of which have a joint Gaussian distribution 
[31]. Just like a Gaussian distribution is a distribution of a ran¬ 
dom variable (characterized by a mean and a covariance), a GP 
is a distribution over functions. It is characterized by a mean 
function and a covariance matrix. In a regression analysis the 
aim is to infer the relation between independent and dependent 
variables, given some set of observations. In parametric regres¬ 
sion we assume some functional relation between the output 
and the input f{x, 0), where 0 represents the set of model pa¬ 
rameters, and regression requires finding the values of 0 which 
best describe the data. Usually, the chi-squared merit function 
is minimized to obtain the best fit parameters. Similarly in GP 
regression, the function f{x) is represented as 


(7) 


^j;A^exp[-i(t-a)^A--(f-a)], 


where 


a = // + K(X, X)^(K(X, X) + C)-' (y - //), (13) 

A = K(X, X) - K(X, X)^(K(X, X) H- C)^'K(X, X) (14) 

and 

Xii =K(X,X)-i-C. (15) 

Here p and q are the number of points in X and X respectively. 
The marginal distribution of y is given by 

Piy) - fp(y,f)df = (2;r)P/4iil^^^ exp[-l(y-/r)^i:r/(y-//)], 

(16) 


f(x) ~ GP(p(x), k(x, x')). 


3 











1.2 


and the conditional distribution /’(f|y) is 

P(f|y) = —— exp[--(f - a)^A“'(f - a)], 

Piy) (27r)9/2|A|i/2 ^^2 

which implies that the reconstructed function f(X) has a Gaus¬ 
sian normal distribution given by 


f ~ GP(a,A). 


(17) 


Here crf and I are unknown parameters of the GP and training of 
a GP involves selecting appropriate values for these parameters. 
This is usually done by maximising the marginal log-likelihood 
probability InP(y) (from (16)) 


InP(y) = -i(y -//)^[K(X,X) + C]-'(y -//) 

- iln|K(X,X)H-C|-|ln27r. (18) 

Further, to do the analysis one needs to specify an input mean 
function. We chose the initial mean function for all quanti¬ 
ties to correspond to a flat ACDM model with - 0.3, and 
Hq - 70 Km/sec/Mpc. The means are adjusted during the anal¬ 
ysis and replaced by posterior means suggested by preliminary 
runs. Our redshift range of interest is 0.1 < z < 0.73 (governed 
by the BAO data range). We divide this redshift range into in¬ 
tervals of Az = 0.001. This gives us 631 target points (z values 
at which the functions are reconstructed). We then reconstruct 
H(z), Di(z) and Dyiz) at these points using GP regression. This 
implies that for each z target point we have a mean and a vari¬ 
ance of the reconstructed functions given by (13) and (14) re¬ 
spectively. Note that maximising (18) is an approximation, and 
can be used if the posterior for 8, is fairly well peaked. Since 
this is not always guaranteed, in our analysis, we sample the 
hyperparameter space and the probability distributions of the 
reconstructed function are weighted by the posterior distribu¬ 
tions of the hyperparameters (in effect we marginalize over the 
hyperparameters). Samples from these weighted distributions 
of H{z), Diiz) and Z)y(z) are used to eventually construct rj{z) 
as given in (5). Note here that rjiz) itself is not a GP, i.e. it is not 
constructed with GP methodology. It is derived using (5) and 
the errors on r](z) are obtained using error propagation (includ¬ 
ing covariances at different redshifts). 


4. Results and Discussion 

In this paper we have used DD relation as a check for consis¬ 
tency between different data sets. The main result of the paper 
is summarised in figure 1 , where we plot the variation of rj with 
redshift, as estimated from the GP reconstruction of the lumi¬ 
nosity distance, angle averaged distance and the Hubble rate. 
The solid blue line is the best fit curve and the black curves 
around it represent the confidence intervals. The red horizontal 
line is the value of rj when DD holds (and all the data sets are 
consistent with each other). Note that the error bars in this plot 
should be understood as point by point along the redshift. The 
estimates of the three reconstructed quantities {H{z), Dl{z) and 
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Figure 1; Reconstructed 77 (solid blue curve) as function of z. 
The dashed, dotted and dot-dashed black curve represent 1, 2 
and 3cr confidence levels respectively. Solid red line represents 
7 = 1 


Dviz)) have correlations at different redshifts and so will the 
estimates of t]{z), but these correlations are not visible in this 
plot. 

We observe that the DD holds within 3cr confidence level. 
These model independent constraints are better than our pre¬ 
vious constraints on rj ([ 21 ]), where we took some simple 
parametrization for 77 and fixed H{z) assuming ACDM cosmol¬ 
ogy. Further the mean value of 77 is slightly less than unity. This 
is similar to results obtained in earlier works in this direction. 
Uzan et al. [15], found a best fit value for 77 which was slightly 
less than one and related this trend to the systematics in the 
SZE/X-ray analysis of galaxy clusters, assuming ACDM cos¬ 
mology. Bassett and Kunz [14], in their three parameter model 
of DD violation, also found that the SNela sample they used 
were brighter relative to their cIa data,. Gravitational lensing of 
the high-z supernovae was suggested as a possible explanation. 
Nesseris and Garcia-Bellido recently used Genetic Algorithm 
approach to extract model independent and bias- free recon¬ 
struction information from SNela, BAO and the growth rate of 
matter perturbations [33]. Our result is moderately consistent 
with their analysis, but we do not recover the dip in 77 in the 
range 0.3 < z < 0.7, that they obtain in their reconstruction. 77 
can also be assumed to be a constant (other than unity) and the 
value of the constant can be estimated from observations, see 
for example one of our previous works [21], Uzan et al., [15] or 
Bernadis et al [34]. 

Our model independent method can also be used to test DD 
relation if the data sets are known to be consistent with each 
other. If this relation is found to be inconsistent with observa¬ 
tions, it would be a major problem for observational cosmol¬ 
ogy, since the optical theorem that relates surface brightness 
of an object at the source and observer, and the temperature 
shift relation of the cosmic microwave background are derived 
from this relation ([17]). In the event that the DD relation is 
not valid, these key relations in cosmology would have to be 
modified. Future surveys (especially the increase in BAO data 
points) would better constrain 77 and this method can be used to 
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look for the presence of systematics within the data sets. 
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